Proper Learning of k-term DNF Formulas from Satisfying Assignments
نویسندگان
چکیده
In certain applications there may only be positive samples available to to learn concepts of a class of interest, and this has to be done properly, i. e. the hypothesis space has to coincide with the concept class, and without false positives, i. e. the hypothesis always has be a subset of the real concept (one-sided error). For the well studied class of k-term DNF formulas it has been known that learning is difficult. Unless RP = NP, it is not feasible to learn k-term DNF formulas properly in a distribution-free sense even if both positive and negative samples are available and even if false positives are allowed. This paper constructs an efficient algorithm that for arbitrary fixed k, if samples are drawn from distributions like uniform or q-bounded ones, properly learns the class of k-term DNFs without false positives from positive samples alone with arbitrarily small relative error.
منابع مشابه
Learning DNF by Statistical and Proper Distance Queries Under the Uniform Distribution
We show that s-term DNF formulas can be learned under the uniform distribution in quasi-polynomial time with statistical queries of tolerance Ω(ε/s). The tolerance improves on the known tolerance Ω(ε/s) and is optimal with respect to its dependence on the error parameter ε. We further consider the related model of learning with proper distance queries and show that DNF formulas can be learned u...
متن کاملP-Sufficient Statistics for PAC Learning k-term-DNF Formulas through Enumeration
Working in the framework of PAC-learning theory, we present special statistics for accomplishing in polynomial time proper learning of DNF boolean formulas having a fixed number of monomials. Our statistics turn out to be near sufficient for a large family of distribution laws—that we call butterfly distributions. We develop a theory of most powerful learning for analyzing the performance of le...
متن کاملMonotone DNF Formula That Has a Minimal or Maximal Number of Satisfying Assignments
We consider the following extremal problem: Given three natural numbers n, m and l, what is the monotone DNF formula that has a minimal or maximal number of satisfying assignments over all monotone DNF formulas on n variables with m terms each of length l? We first show that the solution to the minimization problem can be obtained by the Kruskal-Katona theorem developed in extremal set theory. ...
متن کاملA Note on Approximating Inclusion-Exclusion for k-CNF Formulas
The number of satisfying assignments of k-CNF formulas is computed using the inclusion-exclusion formula for sets of clauses. Recently, it was shown that the information on the sets of clauses of size ≤ log k + 2 already uniquely determines the number of satisfying assignments of k-CNF formulas [1]. The proof was, however, only existential and no explicit procedure was presented. In this paper,...
متن کاملOn Learning Random DNF Formulas Under the Uniform Distribution
We study the average-case learnability of DNF formulas in the model of learning from uniformly distributed random examples. We define a natural model of random monotone DNF formulas and give an efficient algorithm which with high probability can learn, for any fixed constant γ > 0, a random t-term monotone DNF for any t = O(n2−γ). We also define a model of random non-monotone DNF and give an ef...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Electronic Colloquium on Computational Complexity (ECCC)
دوره 24 شماره
صفحات -
تاریخ انتشار 2017